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Abstract —Consider a network of multiple independent stochas¬ 
tic linear systems where a scheduler collocated with the output 
sensors of each system arbitrates data transmissions to a corre¬ 
sponding remote controller through a shared contention-based 
communication network. While the systems are independent, 
their optimal controller design problem may generally become 
coupled, due to network contention, if the schedulers trigger 
transmissions based on state-dependent events. In this article 
we propose a class of probabilistic admissible schedulers for 
which the optimal controllers with respect to local standard 
LQG costs have the certainty equivalence property and can still 
be designed decentrally. Then, two scheduling policies within 
this class are introduced; a non-event-based and an event-based, 
where they both have an easily adjustable triggering probability 
at every time-step. The main contributions of this work are as 
follows: i) proving that for each system, the control loop with the 
event-based scheduler and its optimal controller outperforms the 
control loop with the non-event-based scheduler and its associated 
optimal controller; ii) showing that for each system, the local 
optimal state estimator for both scheduling policies follows a 
linear iteration; i) regulation of triggering probabilities of the 
schedulers by maximizing a network utility function. 

Index Terms —Shared contention-based communication net¬ 
work, Decentralized optimal LQG controller, LQ-Consistent 
event-based policy, Network utility maximization. 


I Introduction 

Event-triggered control (ETC) pertains to strategies that 
manage transmissions in a control loop based on events rather 
than time and is intended for scenarios where communication 
resources are scarce or expensive. Extensive research has 
been carried out in the past decade on ETC, mostly focusing 
on a single control loop closed by a single communication 
channel [ 1 ]—[5]. However, communication management is 
especially interesting in settings where the communication net¬ 
work is shared by multiple control loops [6]—[9], as illustrated 
in Fig. 1. Contention-free protocols such as time-division 
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Figure 1; Networked control loops through a shared communi¬ 
cation network (SCN); P describes the plant, S the scheduler, 
C the controller and z~ l a one time-step delay. 


multiple access (TDMA) [10] enable periodic transmission for 
all the control loops but lead to inefficient bandwidth usage 
when some of the control loops alternate between being active 
and inactive. In fact, under these conditions, systems operating 
through these protocols need a central coordinator for the 
resource reallocation between different users and therefore, 
they are not decentrally scalable. In turn, contention-based 
protocols such as slotted-ALOHA [10] are decentrally re- 
configurable which facilitates the design scalability. However, 
contention in these protocols could hamper the control design 
analysis when the loops transmit based on events. In fact, if 
event-based data triggering is influenced by the control inputs, 
then due to network contention, the local optimal controllers 
are in general not globally optimal anymore and a centralized 
design strategy is needed to achieve optimality of control. 

Motivated by this discussion, in this article we start by 
proposing a class of decentralized admissible schedulers for 
each control loop in a contention-based setting, within which 
the design of optimal control strategies are performed in a de¬ 
coupled fashion. This class specifies that transmissions occur 
based on a constant function of primitive random variables, i.e. 
disturbance, noise and any other possible independent random 
variable. Then a subclass of these policies is introduced 
where its triggering probability is easily tunable. Naturally, a 
non-event-based purely stochastic transmission (PST) policy 
belongs to this class. Our main contribution is to propose 
an event-triggered controller, consisting of an event-based 
scheduling policy in this class and an optimal control policy 
with respect to an average quadratic cost, that outperforms 
the PST policy also with the associated optimal control input. 
Inspired by [11], we refer to this property as LQ-consistency. 
Moreover, we propose a method to regulate the schedulers 
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based on their triggering probabilities in order to maximize a 
network utility function in the sense of providing a propor¬ 
tional fairness between the users and taking into account the 
control performance of each loop. 

The event-based scheduler proposed in this work is in¬ 
spired by the stochastic threshold event-triggered transmission 
(STETT) policy [11], [12] based on which data is triggered 
when the norm of the error between local and remote state 
estimators becomes larger than an exponentially distributed 
random threshold. We adapt this scheduling policy to the 
control loops of the contention-based communication network 
and show that the optimal control law, can still be found in a 
decentralized fashion based on local state estimators following 
the Kalman filter. 

Related work : Comparing the performance of time-triggered 
and event-triggered scheduling policies over shared communi¬ 
cation networks has received considerable attention in recent 
years [13]—[15], In [13], it is shown, via a numerical example, 
that a threshold-based event-triggered scheduling policy with 
CSMA protocol results in a better performance, measured 
by the average state variance than a time-triggered policy 
with TDMA or FDMA protocols. Moreover, [14] considers 
the average state variance performance index and compares 
its values for different communication protocols and load 
conditions. Based on that, an event-based policy over ALOHA 
protocol performs better than a time-triggered policy over 
TDMA protocol for low communication loads, however, when 
the communication load is high, the performance of the 
event-triggered policy over ALOHA protocol becomes worse. 
Moreover, [15] claims that event-triggered policies for the state 
estimation may perform worse than time-triggered policies, 
in the sense of average state error covariance, if the effect 
of the communication network is explicitly considered. Some 
researchers also consider stability of the control loops when 
transmitting through a shared communication network [16]— 
[18], In [16], the triggering probabilities of every scheduler 
transmitting through the CSMA protocol are determined in 
order to guarantee Lyapunov mean square stability (MSS). 
These triggering probabilities can be used to tune the threshold 
of event-based scheduling policies. Furthermore, an error- 
based dynamic priority allocation mechanism is employed for 
the network contention resolution of the event-triggered data 
of multiple dynamic users and then the stochastic stability of 
the overall network is proved in [17], [18], Optimal co-design 
problem of an event-triggered scheduler and a controller is 
investigated in [19], [20] as in the current work. We extend the 
result of [19] for multiple control loops closed over a shared 
contention-based network. Establishing the optimality of the 
linear certainty equivalent controller when the event-based 
scheduler just depends on the primitive random variables, is 
the main contribution of [19]. Moreover, it is shown in [20] 
that the optimal event generator for data transmission of the 
minimum-variance output feedback control is the solution of 
an optimal stopping time problem. More related work can be 
found in [21]-[26], 

Contribution: The main contributions of this article com¬ 
pared to its conference versions [27], [28] are as follows: 

• Definition of a class of scheduling policies for the users 


of a contention-based communication network within 
which the decentralized design structure of the optimal 
controller is maintained. 

• Establishment of the consistency of the proposed event- 
triggered control for time-varying triggering probabilities 
which is needed to maximize the network utility after 
every change in the number of the network users [29]. 

• Determination of the schedulers triggering probabilities 
by using the notion of network utility function. 

• Investigation of the output feedback case, as extension to 
the results in [27] where it is assumed that the perfect 
state is available for every scheduler. 

Organization: The remainder of this article is organized as 
follows: the problem of interest is introduced in Section II, 
the decentralized optimal control policy for the admissible 
schedulers is determined in Section III, the state estimator and 
the performance of the non-event-based PST is determined 
in Section IV, the novel event-based scheduling policy is 
introduced in Section V and the main results of the article are 
presented in Section VI. Moreover, we discuss the decentral¬ 
ized implementation of the proposed policies and the network 
utility maximization in Section VII. Finally, the effectiveness 
of the results is demonstrated through numerical simulations 
in Section VIII and Section IX presents some concluding 
remarks. The proofs of the lemmas and theorems can be found 
in the Appendix. 

Notation: f(x\y) denotes the conditional probability density 
function (pdf) of a random variable x given the set of informa¬ 
tion y and Af(y, Y) indicates a multi-variate Gaussian pdf with 
mean y and covariance Y. Pr(.) denotes the probability of an 
event; 5 ~ B(p) indicates that the random variable 5 follows 
a Bernoulli distribution with probability p. By g(A) and tr(,4) 
we denote the spectral radius and the trace of the square 
matrix A , respectively. Moreover, No := N U {0} in which N 
is the set of natural numbers and Z ® := {k £ Z\t < k < sj 
where Z is the set of integers. 

II Problem setting 

Consider a networked system comprised of multiple inde¬ 
pendent stochastic linear time-invariant (LTI) subsystems each 
modeled with the following discrete time dynamics 

x l +1 — AiX l k + Bi.u l k + w l k , 
y i k = C i xi + v i k 

in which x k £ K ni , u k £ R mi and y k £ R 0i are, respectively, 
the state, the control input and the output vectors at time-step 
k £ No for i £ iT™ where m is the total number of subsystems. 
Let {w l k \k £ No} and {v k \k £ No} be sequences of i.i.d. 
Gaussian random variables with zero means and covariances 
Wi = E [w k w l k ] and Vi = E[u}u} T ] for every k £ No- More¬ 
over, the pairs (Ai,Bi) and (Ai,Ci) are assumed to be 
controllable and observable, respectively. The performance 
of each system is measured by the following local average 
quadratic cost 

1 T ~ 1 

J l = lim sup — E[ x} T QiX l k + UjJRi.u k ] (2) 

T-t OO I , „ 
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in which Q, and II, are positive semi-definite and positive 
definite matrices with appropriate dimension, respectively, and 
( Ai,Q ?) is assumed to be observable for every ieZJ". 
Therefore, each dynamic user is characterized by the tuple 
(Ai, Bi, Ci , Wi, Vi, Qi, Ri), which in general and typically is 
different from the other users, i.e. the users are heterogeneous. 

As depicted in Fig. 1, we assume that the sensors of every 
subsystem are collocated with a scheduler that arbitrates data 
triggering in the control loop over the shared communication 
network. Moreover, since different subsystems are assumed 
to be physically independent, every scheduler just needs to 
transmit its locally acquired information to its corresponding 
remote controller. We also assume time-synchronization of 
the sampling process of different subsystems where they all 
have equal sampling time. Besides the performance index (2) 
for each individual control loop, one may also define a 
social index such as i J l or a network utility function 
as suggested in [29], This will be discussed in Sections IV 
and VII. 

In the remainder of this section, we introduce the operation 
mechanism of the shared communication network, the infor¬ 
mation structure of the scheduler and the controller of every 
control-loop and the main characteristics of the scheduling 
policies which admit a decentralized control design structure. 
This section will be concluded with the problem statement of 
our interest. 

II-A Assumptions on the shared communication network 

The assumptions on the shared communication network 
are very close to the ones considered in the context of the 
contention-based protocols [10] and can be summarized as 
follows: 

• Time is partitioned into fixed-size slots; during each time- 
slot only one user can transmit successfully. 

• All users are restricted to start a new transmission at the 
beginning of the time-slot. 

• Collision will occur if more than one user attempt to 
transmit which results in loss of the collided data. 

• Every user should receive a data receipt acknowledgment 
at the same time-slot if data transmission is successful, 
otherwise, it is assumed that a collision has occurred. 

Besides these assumptions, we make two additional assump¬ 
tions suited to control applications: 

• There is no retransmission mechanism for the collided 
data. 

• The transmission time is assumed to be negligible with 
respect to the duration of the time-slot, i.e., the controller 
is assumed to access data at the beginning of the time-slot 
when a successful transmission occurs. 

The first of these two assumptions is different from the stan¬ 
dard retransmission mechanism of contention-based protocols 
and is motivated by the fact that retransmissions in control 
applications would result in long delays and buffering new 
data instead of transmitting it immediately (see [14, Theo¬ 
rem 13]). Therefore, we consider that after a collision/data 
drop, the newest sensor information can be obtained and a 
new transmission with fresh data can be attempted. As we 


shall see shortly, the schedulers will be restricted such that 
this new transmission attempt occurs with a given probability 
at every time-slot, as this is the case in the retransmission 
mechanism of slotted-ALOHA, except that new data is sent 
in our setting. Moreover, implicit in the second assumption 
is the fact that we consider the time-step of the discrete-time 
system k to coincide with the time-slot index, i.e., x l k pertains 
to the state of the system at the beginning of the time-slot. 


Il l) Information structure 

In this section, we introduce the information set available 
for the scheduler and the controller at every time-step. Let 

4 11, if the scheduler i attempts data transmission at k 

1 0, otherwise 

and 

i J 1, if the network is available for the user i at k 
Pk — \ 

I 0, otherwise 

for every user i £ Z™ at every time-step k £ No- Based on 
the properties of the shared contention-based communication 
network introduced in Section II-A, we can write 

m 

pi= n a-**)- 


Moreover, let a k = p k 5 k be a variable indicating a successful 
transmission at every time-step, in which case o k = 1 and 
a k = 0, otherwise. Note that based on the structure of the 
shared communication network in Section II-A, every sched¬ 
uler receives an error-free acknowledgment signal from the 
controller whenever an attempted transmission is successful, 
and therefore it knows all the previous values of p k . Accord¬ 
ingly, in order to decide on S k the scheduler has the following 
information set at a time-step k 

n = \e G = 1A*G 4 -1 }uM}. (3) 

Let **|fc = E [*M and 4+ii k = E [4+il J fc] be the state 

estimations of the current and the next time-step computed 
by the local state estimator collocated with the scheduler. 
We know that the optimal least-square state estimator at the 
scheduler is linear and can be computed recursively by the 
Kalman filter as follows 

4+i | fc = AiX k \ k + Biu \, 

4|fc = 4|fc-i + L i(y l k - Cii\ | fc _t) 

where 

U = QiCj (CiOiCJ + V)- 1 , 

0» = + w t - A, l.,(C,+ Vi)L]Aj. 

For simplicity we assume E[(xq — x^ q ){xq — £q| 0 ) t ] = O; 
which implies E[(a:* fc - x l k ^ k ){x\ - x l k ^ k ) T ] = 0 i; Vfc £ N 0 

since (-), is the fixed point of the Kalman filter’s time-varying 
Ricatti equation. 

When the triggering condition is satisfied, i.e. Sj. = 1, the 
scheduler transmits state information over the communication 
network to be received by its corresponding controller. A 
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natural choice would be transmitting all the newly acquired 
output vectors over the time period starting from the last 
successful transmission up to the current time-step. However, 
in this case, the information set available to the remote state 
estimator is the same as the one available to the local state 
estimator. This means that the optimal remote state estimation 
in case of a successful data transmission is equal to the one 
obtained by the Kalman filter in the scheduler [30]. Therefore, 
the scheduler can transmit x' k \ k which is an equivalent and a 
more compact form to all the output values not transmitted 
so far. Accordingly, the information set available for the 
controller at every time-step k is as follows 

Jl = = 1 A £ G Z k 0 } U {a\\t G Z*}. (6) 

II-C Required characteristics for the scheduling policies 

We define next a class of admissible schedulers for which 
it will be shown (in Section III) that not only the optimal 
controller has the certainty equivalent property, but also it can 
be computed decentrally for the users of the shared contention- 
based communication network. Let us denote all independent 
random variables Xq, w l k , v' k of any control loop at all time- 
steps as the primitive random variables. 

Definition 1: (Admissible schedulers ) Any scheduler which 
depends only on the primitive random variables of its corre¬ 
sponding control loop, i.e. 

4 = 9( X b’ W 0:k-l’ V 0:k’ a 0:k) 

where the triggering law g is fixed and a[ yk represents all the 
other independent random variables of the same control loop, 
is called an admissible scheduler. □ 

Let us refer to a subclass of the admissible schedulers with 
an easily tunable triggering probability as tunable admissible 
schedulers. Moreover, assume that the transmission mecha¬ 
nism of each user of the contention-based communication 
network is based on a tunable admissible scheduler with a 
given triggering probabilities p k , Vi G Z at every time-step. 
Then from a single control loop perspective, the contention- 
based communication network can be abstracted as if at every 
time-step there is a probability 

m 

Qi= n u-K) (7) 

that all the other users are not trying to transmit and the 
network is therefore available. Hence, at every time-step, 
the control loop of interest has a successful transmission 
probability of 

4 = 44 (8) 

which directly affects stability and the performance charac¬ 
teristics of that control loop, and also the overall networked 
system. 

II-D Problem statement 

In this section, we first introduce some concepts and then 
state the problem to be tackled in this article. When every tun¬ 
able admissible scheduler of the network uses the information 


set defined in (3) for deciding on data transmission, then we 
call the scheduling policy an event-triggered or event-based. 
Otherwise, i.e., if the tunable admissible scheduler operates in 
a non-event-based fashion with a given triggering probability 
at every time-step, then transmissions happen purely stochas¬ 
tically as defined next: 

Definition 2: (Purely stochastic policy ) For any control loop 
of the contention-based communication network, a non-event- 
based tunable admissible scheduler operates by triggering data 
purely stochastically, i.e. 

4’ ps ~B(4), (9) 

where p k is a given triggering probability at every time-step 
k G No- This transmission mechanism is referred to as purely 
stochastic transmission (PST) policy. □ 

The PST is a suitable non-event-based tunable admissi¬ 
ble scheduler for transmitting through the contention-based 
networks. We will show in the sequel (see Theorem 1 and 
Lemma 1) that given the PST scheduling policy for any 
control loop of the network one can compute its optimal 
control law analytically. Now inspired by [31], let us define 
the following LQ-consistency property for any possible event- 
triggered control policies: 

Definition 3: (LQ-Consistency) An event-triggered control 
policy of any control loop is called LQ-consistent if it results 
in a better performance, measured by the average quadratic 
cost (2), than that of the control loop with the PST policy and 
its associated optimal controller. □ 

Then the problem we are interested in can be stated as 
follows: Find an LQ-consistent decentralized event-triggered 
control policy suitable for the users of the contention-based 
communication network. 

Ill Decentralized optimal control design 

In this section, we establish that when the schedulers of the 
contention-based network users are admissible according to 
Definition 1, then the optimal control design problem has the 
certainty equivalence property and is equivalent to the design 
of several locally optimal controllers. We need the following 
assumption in future analysis. 

Assumption 1: All the schedulers of the control loops 
of the shared contention-based communication network are 
admissible according to Definitions 1. □ 

Theorem 1: Let Assumption 1 hold and the control signal u\ 
is generated according to the following state feedback law 

4 = K i E l4\Jk) do) 

for every i G Z\" where 

Ki = -(BJ Pll, + Ri)~ 1 BjPiAi, 

Pi = AJP^ + Qi- KJ(.BJPiBi + R t )Ki. 

Then the control policy (10) is optimal if the loop is MSS, 
i.e., sup{E[xj[.a;]j. T ]} < c for a c G [0, oo) and k G No- □ 
Proof of Theorem 1 : Let us consider the whole networked 
control system centrally and denote J k = U 'iL\J k as the total 
information available for the hypothetical central controller at 
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every time-step. First of all, we have to establish the certainty 
equivalence property, i.e., that the control law takes the form 

4 = K,E[x l k \J k ], (12) 


where due to physical independency between different dy¬ 
namic users, the optimal control gain for every system can 
be determined independently, as given in (11). As explained 
in [32], we can conclude certainty equivalence property if we 
show the following two features: 

• Independency of the scheduling law from the control 
inputs which prevents the generation of the control inputs’ 
dual effect in the control loops. 

• Independency of the remote state estimation error from 
the control inputs. 

Since every Sj, follows a constant function of the primitive 
random variables, any o k = S\ Iljli jjtiO- — S 3 k ) is indepen¬ 
dent from all control inputs which indicates that the first 
required property holds. Moreover, following the same steps 
as in [30], we can prove that the state estimation errors, i.e. 
e k\k = 4 ~ ^[41^4 f° r b°th forced an d unforced dynamics 
with the same realization of the primitive random variables are 
equal, therefore, the state estimation error in the controller is 
independent from the control inputs. Accordingly, the optimal 
control law for every system follows (12). 

Now we have to show that ^[x\\Jk] = E[4l4c] holds for 
all dynamic users at every time-step. Based on the Bayes law 
of conditional probability, we have the following equality at 
all time-steps and for all dynamic users 


/(4 




jj\ _ 1^4 1 x k ) ,, i I jis 

k) ~ PrtU^JM) 


where it is clear that the fraction term in the above equa¬ 
tion is equal to one and /(4l4> 4fc) = f^ x k\^k) 

which concludes the statement. Therefore, the optimal local 
controllers are equivalent to the optimal central controller. ■ 
Based on Theorem 1, the optimal control and the remote 
state estimation problems admit a decentralized structure when 
all the schedulers are admissible according to Definition 1. 
Accordingly, we can analyze the controller of every closed- 
loop system independently from the others as depicted in 
Fig. 2. Furthermore, in Sections IV and V, we establish that 
for the PST and an event-based transmission policies the state 
estimation required in (10) is computed by a local Kalman 
filter. 



Figure 2: A decoupled control loop of the contention-based 
shared communication network (SCN). 


IV Purely stochastic transmission policy 

In this section, we analyze every control loop of the shared 
contention-based communication network where its scheduler 
is operating based on the PST policy introduced in Defini¬ 
tion 2. First, it will be shown that given Assumption 1, the 
remote state estimation of this control loop follows a linear 
iteration (Kalman filter). Second, the closed-form expression 
of the local average quadratic performance is determined when 
all the schedulers have constant triggering probabilities at 
every time-step. 

Lemma 1: Consider that the Assumption 1 holds, then for 
every control loop with the scheduler operating based on the 
PST policy (9), we have 

4+11 fc = A i x l\k + Bi u h 
-i f4| if 4 = 1 > ( 13 > 

\4|fc_i> otherwise, 

at every time-step where a;L fc = is the remote state 

estimation needed for the calculation of the optimal control 
policy in (10). □ 

The proof of Lemma 1 is available in the Appendix. 

The MSS and the performance of the control loops of 
the shared contention-based communication network depend 
highly on the data transmission probabilities. The following 
lemma provides the MSS condition and the optimal perfor¬ 
mance of every subsystem when the scheduler is operating 
based on the PST policy. 

Lemma 2: Consider that the Assumption 1 holds and all 
the schedulers are transmitting with constant probabilities at 
every time-step, i.e. p l k = p l , Mi £ Z ™ and at all k £ No- 
Then every control loop with the PST policy is MSS if 

g(V l - gV4) < 1 (14) 


where q‘ is determined by (7), and the optimal average 
quadratic performance of subsystem i is 

OO 

r ps = tiiPiWi) + £(i - qyy+MMWiAi'Yi) 

3= 0 

OO 

+ ^ g y(l-gy)M40 i Afy i ) 

3=0 

(15) 

where Yi = Kj(BjPiBi + Ri)Ki , and 0^ is given in (5). □ 
The proof of Lemma 2 is available in the Appendix. 

A key question is how to find the optimal values of the trig¬ 
gering probabilities in order to minimize a social performance 
index. The natural answer might be to consider the following 
optimization problem 

m 

(y,...,yy =arg min (16) 

p ’ ■’ pm 

However, (16) is in general a non-convex and non-separable 
and therefore, not easily solvable. In Section VII, we determine 
the constant triggering probabilities of the schedulers based 
on a network utility maximization criterion which is more 
tractable. 
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V Proposed scheduling policy using stochastic 

THRESHOLDS 

The novel tunable admissible scheduler proposed in this 
work combines the features of the STETT [11] and the PST 
policies. In this section, we first introduce the STETT policy 
and discuss its advantages. Then, we propose the combined 
scheduling policy which is in the class of tunable admissible 
schedulers. 


V-A Stochastic Threshold Event-triggered Transmission 
Inspired by [11], the data triggering mechanism of the 
STETT policy for every linear system with Gaussian distur¬ 
bance and noise is defined as follows 

- r i, — 1 

. e;,, 

(17) 


ci.st 
s k = 


if y T 


2 w fe | fe —1 — l c fc | fc —1 ^ ' fc > 


otherwise, 


in which r k ~ exp(A{) for A} G K>o is an exponentially 
distributed random threshold, eL fc _i = x^ k — x\\ k _i and 
e k\k = ^k\k ~ ^k\k are t ^ le predicted and the updated state 
estimation errors between the local and the remote state esti¬ 
mations with the covariances as i & l k \ k _ l = E[e ^ k -i e k\k-M 
and %\k = E I4| k e k\ k \X k ], respectively. At every time-step, 
the value of e k , k _ 1 can be determined by subtracting the 
updated state estimation by the Kalman filter (4) and the 
predicted state estimation as given in (13). 

Unlike deterministic threshold event-triggered transmission 
policies, STETT policy keeps eU fc _ 1 Gaussian distributed at 
all time-steps when there is no data collision or drop-out [12], 
However, as we shall see shortly, although after a successful 
transmission e l k \ k _ 1 is Gaussian distributed until the first new 
attempt to transmit, in case of data collision in the current 
transmission attempt, the distribution of the state error will 
become the sum of two Gaussians at the following time-step. 

To clarify this statement, note that the state estimation error 
used in the scheduling law (17) has the following dynamics 


e fc+l|fc — 4 a k)^i e k\k + -^i(C'i4+l|/c + V k + 1) (^) 

where e\. +1 \ k = x\ — x l k \ k _ 1 is the state estimation error of 
the Kalman filter which has a Gaussian distribution with zero 
mean and 0, as its covariance matrix for all time-steps. It 
can be shown that the state estimation error dynamics in (18) 
depends only on the following information set at every time- 
step fc, 

n = R R rl p\\t G 4" 1 } U R} U {4} 


which indicates that the SETT policy (17) is a function of the 
primitive random variables and therefore, can be categorized as 
an admissible scheduler according to the Definition 1. Shortly 
we will see that the triggering probability of the STETT policy 
can be easily tuned when there is no data collision or loss in 
the communication network. 

Let 4—i = L then e l fc|fc _ 1 = + 4)> which is 

clearly Gaussian. Therefore, assume that the distribution of the 
predicted state error is Gaussian at time-step k as it is the case 
if 4-i — L but with an arbitrary covariance ’4| fc _ 1 , i.e. 

/(4| fc -il4)=R0,n |fe _ 1 ). (19) 


Then the next lemma shows that the pdf of the predicted 
state estimation error at the following time-step, i.e., k + 1, 
is Gaussian in case of no data triggering {S k st = 0) and in 
case of a data triggering and collision (6 k st = 1 A p l k = 0) the 
state estimation error becomes sum of two Gaussians. 

Lemma 3: Assume that the distribution of the predicted state 
error follows (19) at time-step k, then 

Pi = PR’** = 114) = 1 - (1 + A£)-V (20) 


is the probability of the data transmission by the STETT 
scheduler (17) at time-step k in which n, is the dimension 
of the state vector. Moreover, 


/(<W*k = o,n)=m,n +llk ) (2D 


and 


M+i\kW =1 .4 = R 

= \k) 

Pk Pk 


( 22 ) 


are the pdfs of the predicted state estimation error at time- 
step k + 1 in case of no data triggering (S k st = 0) and data 
collision ( S k st = 1 A p' k = 0), respectively, where 


A]% lk _ 1 A i + $ l 




k+l\k 


I —A]¥ k \ k _ 1 A l + <t> l , 


(23) 


□ 


in which <f>i = A\ QiAi — O; + W t . 

The proof of Lemma 3 is available in the Appendix. 

According to Lemma 3, the distribution of the state esti¬ 
mation error (e^fe-i) remains Gaussian over the time period 
between the last successful transmission and the first subse¬ 
quent data collision. Moreover, within this time interval, based 
on (20) the triggering probability depends only on the random 
threshold parameter A], and thetefore, we can easily regulate 
it as follows 


A* = 1 — (1 — Pk)" 


(24) 


However, when a collision happens, the distribution of the state 
error becomes the sum of two Gaussians. More specifically, it 
can be shown that in between every two successive successful 
transmissions, every collision doubles the number of Gaussian 
terms of the state error pdf [28]. When the number of Gaussian 
terms is more than one, the triggering probability depends not 
only on the threshold parameter but also on the covariances 
of the multiple Gaussian terms. Therefore, it is not trivial 
to regulate the triggering probability desirably after the first 
collision instance. This motivates the proposed scheduling 
policy discussed in the next section. 


V-B Combined Event-triggered Transmission Policy 

In this section, we propose a combined event-triggered 
transmission (CETT) policy n 1 = p\, p l 2 ,...), where 

p k :I k -> {0,1} and S k M = p k (l k ), which inherits the ad¬ 
vantages of the STETT policy and is in the class of tun¬ 
able admissible schedulers. Based on this policy, after every 
successful transmission, the scheduler triggers based on the 
STETT policy with any desired probability p k up to the 



7 


time-step at which the first collision happens. After that, the 
scheduler keeps triggering based on the PST policy with the 
desired probability p\ until the next successful transmission 
time. This process is repeated between every two successive 
successful transmissions. 

Definition 4: ( CETT ) Let i\ \= max{f < k\a\ = 1} be the 
time of the last successful transmission before the current time- 
step k. Then, we can specify the CETT policy as follows 

| 's z k ’ st , iffc=4 + i 

6^=1 or if (^ 1 ,...,^) = (0,...,0) (25) 

otherwise, 

where S k si follows (17) with A} determined by (24) for a 
given p l k as the triggering probability and S k ps ~ B(p}) for 
all time-steps. □ 

In Fig. 3, different admissible schedulers introduced in this 
work are categorized based on the tunability of their triggering 
probability. We know that the PST policy is always easily 
tunnable. However, the STETT policy is only easily tunnable 
after every successful data transmission time-step up to the 
next data triggering time-step. Therefore, we can say it is 
partially in the class of the tunable admissible schedulers. The 
CETT policy in (25) is proposed to follow the SETT policy 
as long as it is easily tunable, otherwise, it switches to follow 
the PST policy. The hatched region in the figure indicates the 
CETT policy. 


Admissible schedulers 



Figure 3: Relations between different admissible schedulers 
for the contention-based network. 


VI Main results 

In this section, we state the main results of the article. 
In the beginning, we introduce the optimal control policy 
associated with the CETT policy (25) in Theorem 2. Then we 
define the combined event-triggered control (CETC) policy in 
Definition 5. Finally, we establish the consistency property in 
Theorem 3 by showing that the CETC policy outperforms the 
PST policy when performance is measured by the average of 
local LQG cost functions. 

Theorem 2: Let Assumption 1 hold and for a control loop 
the scheduler follow the CETT policy with a given set of 
triggering probabilities P* = {p\\k £ No} and the controller 
follow (10) with the LQG gain in (11) and the linear state 
estimation in (13). This control policy is optimal if the loop 
is MSS, i.e., sup {E[a:} a;} 1 ]} < c for a c £ [0, oo) and k £ No- 

□ 

The proof of Theorem 2 is available in the Appendix. As 
it can be seen in the proof of Theorem 2, the pdf of the 
remote state estimation associated with the CETT policy (25) 


is Gaussian at every successful transmission time-step. How¬ 
ever, it follows a sum of Gaussians where the number of 
Gaussian terms is two at the first time-step after a successful 
transmission and is increased by two every time-step elapses 
with no successful data transmission, i.e., 

f 1, if = 1 

4 = < 2, if 1 A f k = k - 1 

I k\_ 1 + 2, otherwise. 

where k\ is the number of Gaussian terms of the remote state 
estimation error pdf at time-step k. It is worth noting that, these 
Gaussian terms all have the same mean values determined 
according to (13). 

Definition 5: {CETC) The combination of the CETT pol¬ 
icy (25) and its corresponding optimal control law (10), (11) 
and (13) is denoted by the CETC policy. □ 

We state next the main result of the article. 

Theorem 3: Let Assumption 1 hold and a control loop 
be MSS when its scheduler follows the PST policy for a 
given set of triggering probabilities P* = {//[.\k £ No} and its 
controller follows the optimal policy (10) determined by (11) 
and (13). Then, the average quadratic performance (2) of this 
control loop when its scheduler-controller is operating based 
on the CETC policy strictly outperforms the optimal control 
performance when the scheduler is operating based on the PST 
policy, i.e., 

V < V 

d -K ^ d pS 

when the set of triggering probabilities for both scheduling 
policies follows P\ □ 

The proof of Theorem 3 is available in the Appendix. 

Remark 1: Based on Theorem 3, the MSS of each control 
loop with the scheduler following the PST policy for a 
given set of triggering probabilities P* = {p\\k £ No} and its 
associated optimal controller can also guarantee the MSS when 
the control loop is operating based on the CETC policy. 

VII Decentralized Implementation and Network 

UTILITY MAXIMIZATION 

From the discussions so far in this article, it can be 
concluded that all the schedulers and the control policies only 
require local information and therefore, can be implemented 
in a decentralized fashion. In this section, we first discuss 
how to regulate the schedulers to optimize a social criterion 
by determining the triggering probabilities, and then we show 
how to implement this optimization decentrally. 

Determining the triggering probabilities of the schedulers 
based on the total average quadratic performance as in (16), 
results in a non-convex and a non-separable optimization 
problem, which is difficult to solve, especially in a distributed 
way. Therefore, we take a notion of network utility from [29] 
which considers a weighted proportional fairness between the 
network users. Based on that, for every user, we assume a 
constant triggering probability at every time-step (therefore, 
we drop its time index k for simplicity) and assign to it a 
utility allocation function as follows 

EPfa 4 ) = cMogfa*) 


(26) 
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where rf defined in (8) is the constant successful transmission 
probability and the constant c 1 £ R>o determines the trans¬ 
mission priority of every user which can be selected based on 
the average quadratic performance of every control loop (15). 
Then the optimal successful transmission probabilities are 
determined as follows 

m 

(t? 1 *,..., ? 7 m *) = arg max (27) 

jj 1 ,...,!)™ ' 

3 = i 

The following lemma, proposes the solution of this problem. 

Lemma 4: The solution of (27) results in 

p l * = (28) 

for every i £ Z\" as the optimal triggering probability of the 
schedulers of the contention-based communication network 

introduced in Section II-A. □ 

The proof of Lemma 4 is available in the Appendix. 
Moreover, we propose the following methods for the selection 
of c\ i£Z?: 

(i) The coefficient c 1 for every i £ Z™ can be selected based 
on the parameters which affect the average quadratic 
performance of the corresponding system. According to 
Lemma 2, a natural choice is 

c* = aMAiWiAjYj + (1 - aiMAiQiAjYi) 

where cn £ [0,1] can be selected arbitrarily. 

(ii) We can also select all transmission priorities to be 
equal which results in p l * = 1/m for all i £ Z\ n . This 
triggering probability is equal to the optimal triggering 
probability for maximizing the throughput of the slotted- 
ALOHA communication channel [10]. 

Both strategies for tuning the triggering probabilities can be 
implemented in a decentralized fashion, as long as every node 
in the network has access to c l of all nodes. This parameter 
can be either broadcasted at every time-step the node is active 
or just broadcasted once when a node joins the network (in a 
successful transmission), and when a node leaves the network 
a terminal message should also be sent such that each active 
node is informed which node has left the network. 

VIII Numerical simulations 

In this section, we illustrate via a numerical example that 
the proposed CETC policy performs indeed better in com¬ 
parison to the PST policy and we assert the performance 
gains. Consider a scalar LTI subsystem with A = 0.9, B = 1, 
C = 1.5, D = 0, W = 1 and V = 1.5. Due to the decentral¬ 
ized structure of the proposed policies, we can consider just 
a single control loop of this networked control system and 
drop the index i of the control loop parameters. Moreover, 
let Q=1 and f? = 0.1 be the parameters of the average 
quadratic performance. Then, the state feedback controller and 
the Kalman filter gains are K = —0.8233 and L = 0.4476, 
respectively. We consider a constant triggering probability for 
the scheduler of this control loop at all time-steps. In Fig. 4, we 
compare the average quadratic performance of both policies 
for two different constant probabilities that the network is 



Figure 4: Average quadratic performance comparison between 
the PST and the CETC policies 



0 0.2 0.4 0.6 0.8 1 


Triggering Probability (p) 

Figure 5: Average quadratic performance gain percentage of 
the CETC policy in comparison with the PST policy 

free q £ {0.50,1.00} for this control loop at all time-steps. 
These plots illustrate what we have observed in Monte-Carlo 
simulations. For each pair of (p, q ), we consider v.mc = 10 
as the number of Monte-Carlo runs where for each of them, 
T = 100000 is the total number of simulation time-steps. The 
initial state for all simulations is assumed to be zero, i.e. 
.x'o = 0. Fig. 5 shows the percentage of the performance gains 
of the CETC policy with respect to the purely stochastic policy, 
i.e. %(J ps — J n )/Jp S - As can be seen, when the availability 
probability of the network (q) for a specific control loop is 
higher, the performance gain obtained by the CETC policy is 
also higher. 

IX Conclusion 

This work considers multiple independent linear systems 
communicating through a shared contention-based commu¬ 
nication network with their local remote controllers. We 
introduce a class of admissible schedulers which provides 
a decoupled optimal control design structure for the users 
of the shared contention-based communication network and 
is proved to have the certainty equivalence property. Then, 
two scheduling policies in this class of admissible schedulers 
are introduced, a non-event-based and an event-based policies 
where their triggering probabilities are easily tunable at every 
time-step. This feature can be used for maximizing the utility 
of the network in the sense of providing a proportional fairness 
between the users of the network. It is proved that for both 
of these two scheduling policies, the local optimal control law 
is determined based on a Kalman filter state estimator. The 
main contribution of this article is the LQ-consistency of the 
proposed event-based control strategy, i.e. for any system, the 
loop with the event-based scheduler and its optimal control 
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law outperforms the non-event-based scheduling policy with 
its associated optimal control law, as the triggering probability 
of both scheduling policies are the same at every time-step. 


Appendix 


A Lemma 1 


We assume that the local estimator is aware of the control 
law and since 3 k _- t C I k , then the local estimator is aware of 
all the previous control input values, i.e. {u l t \t < k} at every 
time-step k. As discussed before, the information set of the 
local and the remote estimator is equivalent at triggering time- 
steps, therefore, x l k > k = x' f , k when a k = 1. However, when 
a k = 0, then the remote state estimation pdf is as follows 


f(4ULi,4 = o) 


Pt(4 = 0|^-d 4) 
PrK = 01JU) 


fK\JLi)- 


Since the unsuccessful transmission probability is in¬ 
dependent from x' k , the fraction term in the right 
hand side of the above equation is equal to one and 
.f( x l\Jk_ 1 ,cr , k =0) = f{ x k \J k _ 1 ) which indicates that in 
case of unsuccessful transmission, it is just needed to perform 
the prediction stage of the Kalman filter, i.e. x l k > k = x k ^ k _ 1 
where x k+1 ^ k = A.iX k ^ k + BiU k . Therefore, the conditional 
state expectation in the remote estimator is proved to be given 
by (13). 


B Lemma 2 

We can represent the Kalman filter as 


x l k +i\ k +i — AiX k \ k + B t u\ + Li(y l k+1 - Cj^fe +1 |fc) (A.l) 

and the remote state estimator as 

%k+l\k = + (1 — a k)AiX\] [k _ 1 + BiU\. (A.2) 

By subtracting (A.2) from (A.l), the dynamics of the state 
estimation error (e t ^ k _ 1 = x\^ k - x^ k _ t ) is as given in (18) 

for 4|k-i = 4 - 4|k-i and 4|fc = 4 - 4 |k where 

efc+i|fc = A t (I - L i C i )e k \ k _ 1 - AiLiV k + w l k , 
efc+i|fc+i = Aie ’ k | fc + w k - L i {C i e' l k+1 | fe + v k+1 ). 

We know that when a k = 1, then the 

updated remote state estimation error is 

4i* = 4-4|fc = 4|fc- However - when 4 = °. then 
4i* = 4 - 4|*, + 4[*, - 4|fc-t = 4ifc + e i\k~v Usin s 

(18) and (A.3), e k \ k will have the following dynamics 

efc+i|fo+i = Aie \| fe + w k (A.4) 

Therefore, we can express the updated remote state estimation 
error dynamics as follows 


e fc+i|fc+i — 


i A A\ k + w l 

\ A i S i\k + w l 


if 4 = 1 

otherwise. 


(A.5) 


Denoting by ^ l k \ k = K[e k \ k e k \ k \J k ] the covariance of the 
updated remote state estimation error at every time-step k , 


based on (A.5), we can conclude that it has the following 
dynamics 

$fc + l|fc + l = (1 — 4 +l)(-^I^fc|fc-^» + fP*) + 

From the fact that <J l k+1 is independent from we obtain 

mi+iik+i] = (1 - q'VKAjEi^A., + Wi) + q ye t 

Moreover, letting ¥ = limsup^^ E[$T fc ], we find 

¥ = (1 - qYXAj&Ai + Wi) + gV0i. 

This equation has the following closed form solution 

OO 

¥ = X)(1-9V) j ‘ +1 (4 T WiAD+qYil-qYyW G^) 

j =o 

which is bounded if o(\J 1 — (fp'd,) < 1 and holds by as¬ 
sumption of the lemma. On the other hand, the average 
quadratic performance (2) is given by [33, Ch. 5] as follows 

1 T_1 

r = tr Wi) + lim sup - ]T tr^E^]) (A.6) 

which can be expressed as 

J l = tr(PjWj) + tr(Fj limsupE[4 |fc ]) = tr (P t W,) + tr^ 1 ). 

k—y oo 

By substituting the solution of ¥ in the above equation, we 
arrive at the closed form of the average control performance 
for the PST policy in (15). 


C Lemma 3 


We consider both situations separately and drop the index i 
for simplicity. 

1) S k = 0: based on the Bayes law of conditional proba¬ 
bility we have 

list n T \ _ Pr (^fc 4 = 0|e fc | fc _i,X fc ) 

I3k\k\o k - 0 ,l k ) - -p r (^( — - .t{e k \ k -i\lk)- 

' (A.7) 

Denote 2 = e k \ k ~\, then based on the triggering policy (17) 
and considering rg = \_ 1 z we have 


Pr(^ 4 


and 


0\z,X k ) = j\ k e~ Xkr dr = e 2 zTAfc (A8) 

ro 


Pr (<5f = 0| l k ) = 


(X k e~ Xkr )e 

det(27rT' fc | fc _i)5 


drdz 



zgR" 


g U T (l + A fc )'I' fc | fc _iZ ^ 

---—i —dz = (1 + Afc) 2 . 

det(27rT' fc | fc _ 1 )2 


(A.9) 

Finally, by substituting (19), (A.8) and (A.9) into (A.7) we get 


f{e k \ k \8 s k 


0 , 2 k ) 


e -iz T (H-A fc )3' fc| 1 fc _iZ 


(1 + Afc) 2 det(27r\I/ fc | fc _ 1 ) 2 


Af(0, 


v I f fe|fc-i 

1 + A k 
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Therefore, when 8 k = 0 the pdf of the updated state es¬ 
timation error ek\k will remain Gaussian. Moreover, since 
v k +i = L(Ce.k+ i|fc + Vk+i) in the dynamics of predicted state 
estimation error (18) is Gaussian, then the predicted state 
estimation error at the next time-step is also Gaussian as given 
in (21) where $ = A J QA — 0 + W = E[i/fc+ii/£ +1 ] for all k. 

2) Sff = 1, Pk = 0: in this case, the controller does not 
receive Xk, therefore 


f(e k \ k \5? = l, Pk = 0,l k ) = 
p r(<5f = l|e fe | fe _i ,p k = 0 ,l k ) 
Pr(<5f = l\ Pk = 0,l k ) 


f (ek|k_i 


By using (A.8) and (A.9) we get 


Pr(<5f = 1| z,p k = 0,l k ) = 1 - 


(A. 10) 


(A.ll) 


and p k = Pr(<5f = l|p fc = 0,l k ) = 1 - (1 + A fc ); £. Then by 
substitution into (A. 10) we get the following 


— — Z T 1 

1 e 2 k\k-l 

f( e k\k\8 k = 1 ,Pk = 0 ,Xk) =—( 


Pk det(27rT ' fc | fc _ 1 )2 

1 > 

det(27rT' fc | fc _ 1 )5 


Therefore, in case of a collision the pdf of the updated state 
estimation error will become the sum of two Gaussians 


/(efc|kl<5f 


1; Pk 


0 ,1k) 


1 

Pk 


N (0) ’Pfc|fc-i) 


1 - Pk 
Pk 


U(0, 


^fclfc-l, 

1 + Afc 


Again with the same conclusion as the one presented for the 
case when 8 k = 0, the predicted state estimation error at the 
next time-step will be the sum of two Gaussians where their 
covariances follow (23). 


D Theorem 2 

First of all, we have to show that the CETT or equivalently 
STETT policy is in the class of admissible schedulers. The 
Kalman filter by the scheduler side and the remote state 
estimator follow (A.l) and (A.2), respectively. By subtract¬ 
ing (A.2) from (A.l), the dynamics of the state estima¬ 
tion error used by the scheduling policy (17) is determined 
by (18). Then, the scheduling law (17) can be represented 
as 8\ = g(lZ k ) where g{.) is an appropriate fixed function 
and = {v l t , w\, r\, p\\t G Z^ 1 } U {v{} U {4} is a set of 
independent primitive random variables. Therefore, the CETT 
policy is in the class of admissible schedulers and the certainty 
equivalent control is optimal based on Theorem 1. 

Now we have to find the estimated state in the controller 
for which we follow an induction arrangement. For the sake 
of simplicity, we consider a single control loop and drop 
the index i. Without loss of generality, let us assume that at 
k = 0, (To = 1 and find the state estimation at k = 1 assuming 
o i = 0. Then, f(x o|cr 0 = 1, x 0 |o) = A/"(£o|o> 0) which can be 
concluded based on the properties of the Kalman filter. Since 


the remote state estimator is aware of the control inputs at all 
time-steps, the pdf of the predicted state at k = 1 is 


/Oi|cr 0 = l,x 0 |o) = A/'(xi|o,ri| 0 ) (A.12) 


where Xi|o = Ax 0 |o + Bu 0 and Tqg = A T QA + W. Then the 
updated pdf of the remote state estimation at k = 1 if o\ = 0 
is determined by using Bayes law of conditional probability 
as follows 

f(xiW o = 1, cri = 0, tc 0 |o) 

_ Pr(or = 0|q-o = 1,4|o? xi)f(xi\<7p = 1,Xq|o) 

Pr(tJi = 0|cr o = l,x 0 |o) 

(A.13) 

Moreover, we have 


Pr((7i = 01 cr 0 = l,s 0 | 0 ,xi) = Pr(<5i = 0|cr 0 = l,x 0 | 0 ,xi) 

+ Pr(pi = 0)Pr((5i = l|cr 0 = 1, x 0 i 0 , Xi). 

(A.14) 

Let z\ = x\ — Xi|i, Z\ = x\ — Xi|o and Z\ = Xi|i — Xi|o 
where f[z\) = A/"(0, 0) and ¥,[zizl\oo = 1] = t^iio for 
vp 1|0 = <f> = A J QA — 0 + W. Then 

Pr(<5i = 01 ctq = 1, x 0 | 0 , xi) = Pr((5i = 0|cr o = 1, Xi|o, xi) = 

OO , 

r r Aie _Air e _ 2 £ i e *i 

Pr(<5i = 0 |ct o = Mi) = J J 


z iGM 71 r 0 , 


det(27T0) 2 


-drdz-i 


o -^(z 1 -z 1 y \'i/ 1 . 1 0 (z 1 -z 1 )-^zie 1 z i 


21 61 


det(27T0) 2 


-dz\ 


for r 0 = = Ha - - h). We can 

simplify the above equation by using the following equality 

{z\ - ii) T AiT'^ | g(^i - z-i) + zj&~ 1 z i 
= (a - ^i) T (Ai^^ + 0 _1 )(ai - z[) + 
where z' x = (AiT'^jp + 0“ 1 ) _1 Ai\l/^gZi and 

n i|o = (Ai^qJ - Ai^J(Ai^J + 0 _1 }" 1 Ai5'[| ( l ) ) 

= 7“ ^1|0 + ©• 

M 

Then 

Pr(<5i = 0|cr o = 1) *o|o> x i) = £ie -5Zini i° Zl (A.15) 
where ^ = l/( det(Ai\E'^jJ + 0” 1 ) det(0)). Moreover, 

Pr((5i = 1 |(t 0 = l,x 0 | 0 ,xi) = 1 - ^ie _5ZlIIl i oZl . (A.16) 


First substitute (A.15) and (A.16) into (A.14) which results in 

Pr(oi = 0|cr o = 1, x 0 | 0) xi) = 1 - qi + qi£ie~i Zl ni i° Zl 

where q\ = Pr(pi = 1), then substitute the result into (A.13) 
which results in the following 


f(x iko 


= 1,0!= 0, f 0 | 0 ) 

1 - qi + 9iCi e_!2 ^ ni| ° Zl 




1 — QiPi det(27rr 1 | 0 ) 2 


2 

= ^/3'AA(x 1 | 0 ,ri |1 ) 

i =1 


(A.17) 



11 


where )T)“ =1 (3[ = 1. As it can be seen, at the first time-step 
after the successful transmission the updated state estimation 
pdf is the sum of two Gaussian terms with different covari¬ 
ances. The total covariance of the estimation is affected by qi 
which is the probability that the network is available. However, 
the mean values of the Gaussian terms are equal to the one 
obtained at the prediction stage (A. 12) and do not depend 
on qi, i.e. xui = E[xi|<to = l,cri = 0, #o|o] = a?i|o- Now as¬ 
sume <7fc = 0, Vfc £ Z[ +1 , then the updated state estimation 
at k = t + 1 is determined as follows 

f{x t+1 \v t ,(Tt+i = 0 , tE 0 | 0 ) 

_ Pr(g t+ i = O|^,^ 0 |o,xt + i)/(xt + i|t/t,x 0 |o) 
Pr(cr t+1 = O\u t ,x 0 \ 0 ) 

(A.18) 

where v t = {oq = 1, <j\ = • • • = cr t = 0}. Let us define 
c = inf{fc|<5fc = 1 A k £ Z{} as the first time-step after the 
last successful transmission where data collision happens, 
otherwise, c = 0. Then we can partition the set v t into several 
mutually exclusive sets as v }, where for every c £ Z\ 

= {ct 0 = 1, <5i = • • • = <5 c _i = 0,{5 C = l,p c = 0}, 

<Jc+i = ■ ■ ■ = cr t = 0} 

and v® = {(Tq = 1, <5i = • • • = <5* = 0}. Therefore, 


Pr(cr t+ i = Q\vt,x 0 \ 0 ,x t+ 1 )f{xt + i\vuX 0 \o) 

t 

= P r Ot+i = 0| i4 , £’ 0 |o, x t+ i)Pr(c = j) f (x t+ i\u J t , £ 0 |o)- 

3= 0 

(A. 19) 

According to the operation mechanism of the CETT policy, 
the predicted state estimation has the following distribution 


f(x t+1 \v 3 t ,x 0 \ 0 ) 


■hf(x t+ i\t, r° +1|t ), if j — o 

otherwise 


where, according to the induction assumption, the mean 
values of all Gaussian terms are equal and determined as 
x t+ i\ t = Ax t | t _i + Bu t and P^ 1|t = AV^A 1 + W is their 
covariance. 

Moreover, based on the triggering policy, after the 
first collision instance which results in two Gaussian 
terms in the pdf of the state estimation, the schedul¬ 
ing policy switches to the purely stochastic policy 
where Pr(<S t+ i = £ 0 |o> *t+i) = 1 ~ Pt+i, Vj ^ 0. How¬ 

ever, following the same procedure as the one for t = 1, 

Pr(cr t+ i = 0\vt,Xo\ o ,x t+ \) = 1 - q t + i 

+ q t+ i^ + ie"’ 2 ‘ T +» n ‘+‘i**‘^ 

where £ t+1 = l/(det(A t+ i'T“ + 1 1 | t + 0” 1 ) det(0)), z t + i = 
x t + 1 - x t+1 \ t and n t+1 | t = Then b Y substi¬ 

tuting the last two expressions into (A.19) and then into (A.18), 
we can arrive at the following 


f{xt +1 \u t ,a t+1 = 0, £ 0 | 0 ) 

_ Ej=o Ei=i 9(i,3,Pt+i,Qt+iW(x t+ i\ t , r tE|t+i) 

1 - Qt+iPt+i 

where g(i,j,p t+ i, qt+i) is a scalar function. Therefore, at 
k = t + 1 the number of Gaussian terms is equal to 2 (t + 1). 


However, the mean of all these terms are equal and not 
affected by the kind of scheduling policy (PST or STETT), 
the triggering probability JH+i, or the collision probability 
q t _ |_i, that is in line with the induction assumption. Therefore, 
x t+ i\ t+ i = x t+ \\ t when <j t +1 = 1 and (13) still holds when 
the scheduler is operating based on CETT policy and the result 
follows. 


E Theorem 3 

Let us consider a single control loop and drop the index i 
for simplicity. Consider TV > 1 as the sampling period for 
every two successive successful transmission, then the average 
control performance (2) can be written as 


N -1 


J “ =tr( - PH ' ) + El(Vj E[ S ,r<Yr ‘ l ‘ )1 


v — 1 




(A.20) 


v—1 


t—0 


which can be established using Wald’s identity as in [11] 
where T“ |t = E[e t \ t eJ lt \x 0 \ 0 ,/ t ] for I t = {o k = 0|fc e Z\} and 
a £ { ps,fij. Given the assumption that both scheduling poli¬ 
cies are triggering with the same probabilities at every time- 
step, to prove Theorem 3, it is just needed to prove 


j; < j; s (A.2i) 

where ./“ = Ylt=i f° r a e {v s iP} an d v £ N> 2 (at 

t = 0 both policies result in the same cost values). For the 
following analysis, let us define 


((0,0), if * = 1 
(m(i),l{ij) = < (1,0), if * = 2 
((0,1), if * = 3 


(A.22) 


where m(i) and l(i) are possible values for the triggering 
variable and the availability of the communication network 
for each user, respectively, which result in no successful data 
transmission to the controller. 

We also need the following technical proposition. 

Proposition 1: Consider 


’fqi(i) = Eleque^Jxoio,^ = m[i),p ? = l(i)] 

as the updated covariance of the state estimation error in the 
scheduler at t = 1 for every transmission epoch with u > 2 
where i £ {1,2,3} and m(i), l(i) follow (A.22). Then 

v—1 

X/ r ( yr t|t)l[ f oi°’ <5 i = m ( i )>Pi = ; (*)1 

t=2 

= tr(L a (^ |1 (f) + 0))+J: 

where L a for a £ {p 7 ps} is a positive definite matrix such 
that < L ps . □ 

The proof of Proposition 1 is provided after the proof of 
Theorem 3. 
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El Proof of Theorem 3: We need to establish (A.21), 
which we shall proof by using induction on v. Suppose v = 2, 
then for every a £ {/x,ps} we have 

Jl = tr(Fr“ |1 ) = E^O^V*))^) 

i= 1 

where 

fi|i(i) =E[e 1 | 1 e{ |1 |x 0 | 0 ,(55 1 = m(i),p i = l(i)], 

S a (i ) =Pr(<5“ = m(i),pi = l(i)) 

for every index value of i £ {1,2, 3} for which m(i) and l(i) 
are defined in (A.22). In Tables I and II we determine the val¬ 
ues of these terms. Consider ro := tr(Y@), r\ := tr(Y$) for 
<I> = AQA 1 — 0 + W, t 2 := an d r 3 := — n — ^ =SL r 2 
which are used in the tables. 


where for a£{p,ps}, jl +l = X^i=i C a (i)S a (i), in 
which C a (i) = tr(Yf“| 1 (z)) + tr(L 0 T'“| 1 (i)). Then let 
us denote si := tr(L ps <I>), s 2 := tr(L M $)/(l + Ai), 
S 3 : = ^(! - and s 4 := tr (L^). Based 

on Proposition 1 

s 2 < s 4 < si. (A.23) 

In Tables III and IV the values of C a {i ) are given for the 
CETC and the PST policies, respectively. 


Table III: Performance terms of the CETC policy 


i 

s- 

pi 

SAi) 

CAi) 

i 

0 

0 

(1 - 90(1 -Pi) 

ro + r 2 + s 2 

2 

1 

0 

(1 - 90P1 

ro + r 3 + s 3 

3 

0 

1 

91 (1 ~ Pi) 

ro + r 2 + S 2 


Table I: Performance terms of the CETC policy when v = 2 


i 


Pi 



i 

0 

0 

(1 - <Jl)(l - pi) 

ro + r 2 

2 

1 

0 

(1 - 9i)pi 

ro + r 3 

3 

0 

1 

91 (1 ”Pl) 

ro + r 2 


Table II: Performance terms of the PST policy when v = 2 


i 

<5f 

Pi 

gps 

tr(yp^) 

1,2 

0 or 1 

0 

(! - 91) 

ro + n 

3 

0 

1 

91 (1 — Pi) 

ro + ri 


As an example, for the CETC policy when i = 2, we have 
<5f = 1 and pi = 0. This condition results in a collision in 
which the updated state error covariance is determined as 

r y[i = 2i = e + i(.-f^)* 

and by substitution into tr(YT , | '| 1 ), it results in ro + r 3 as 
written in Table I. However, for the PST policy, in case the 
network is not available, the updated state error covariance is 
r^|i | [i = 1, 2] = 0 + $ which results in ro + r\ as the control 
performance. Similar reasonings can be applied to the other 
situations. 

Note that 

Jl = (! - Qi)(ro + ri) + gi(l - pi)(r 0 + r 2 ), 

Jls = (! - <?i)( r o + ft) + <7i(l - Pi)(ro + ri) 

from which it is clear that J 2 < J 2 S since r 2 < r-|. 

Now assume (A.21) holds for v = z, i.e. J< Jf s , then we 
should prove the same inequality for v = z 4 1. We have 

J z a +l = E + Etr(Fr“| t )|[z])5 a (z) 

i =1 t =2 

and by using Proposition 1, we can simplify this equation as 
follows 

3 

Ja +1 = E htW) + *(£<>(*?|l(i) + 0 )) + Ja)S a (i)i 

i—1 

Therefore, by considering the assumption of induction, in or¬ 
der to prove J* +1 < Jpf 1 * we just need to prove j* +1 < j** 1 


Table IV: Performance terms of the PST policy 


i 


Pi 

SP s (f 

cp s {f) 

i 

0 or 1 

0 

(! - 9i) 

ro + r-i + si 

2 

0 

1 

9l(l — Pi) 

^0+^1+ si 


Then we have 

Jl +1 = (1 — 9i)( r o + r i + s 4 ) + 9t(l — Pt)(ro + r 2 + s 2 ), 

jpt 1 = (1 - 9i)(fo + ri + si) + qi{l~Pi){r 0 + n + si) 

and by using the inequalities given in (A.23) and r 2 < r \ we 
can infer j^ +1 < jiff 1 which concludes the proof. 

E2 Proof of Proposition 1: This proposition actually 
considers the propagation of the first time-step’s state es¬ 
timation error covariance in the future time-steps during 
every transmission epoch. We know that when cr t = 0 for 
t £ Z'l^ 1 during every transmission epoch, e t \ t = e t \t + £t\t 
where e t \t ~ A/"(0,0). 

For the PST policy, we know that e 4 | 4 ~ A/"(0, + 0) 

which will increase the covariance of the future errors as 
+ 0)A t_1 for all t £ Z. Therefore, the total 
amount of increase of the cost function during every transmis¬ 
sion epoch due to the first time-step state estimation error will 
be 5J ps = YZjZ 2 tr(A TJ_1 YA J_1 ('I'jj f 1 + 0)) which results 
in L ps = A J jYAf Now let us consider the CETC 
policy and denote 

f(l -P)» >2 if Z < 0 

P(l) = l 1 - (1 ~p )1+ " , if 1 = 0 

I 1, otherwise. 

Suppose that at t = 1 the first collision has occurred. 
Then from the next time-step, the scheduler follows the 
PST policy where the increase in the value of the covari¬ 
ance will be as the one obtained for the PST policy, i.e. 
SJp =1 = ZZjZl fr(A TJ,-1 Y-A J ' -1 (\f r 5 i | 1 + 0)) which results in 
L “ =1 = i A T:7 Y A 7 . Now suppose that collision occurs at 
t = 2, then we can show that = ZZZjZi /3(0)A TJ YA 77 and 
if collision occurs at t = k > 2, then 

v—22-k+j-l 

L r fe = E n vm jj YAi. 

j =1 1=2—k 
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Therefore, 

v —1 v —2 

^ = 5>(i - = y *m ij YAi 

fc=i i=i 

where a(j) = p + Efe^M 1 “ P) fc ^ £(0- In or¬ 

der to prove L p < L ps , it is just needed to prove a(j) < 1 
for all j £ Z'Y For an arbitrary j we have 

a(j ) = P + /3{0)p{l -p) + /5(—l)/?(0)p(l - p) 2 + ■ • • 

+ /3(-l)' 7_ V(0)p(l - p)° + P(-l) J p(l - p) J+1 
H- \-P(-l) 3 p(l ~P) V ~ 2 = ••• 

= 1 - (1 -p) v+ %~ 2 < 1 

which concludes our statement and proves the proposition. 


F Lemma 4 

We know rf = p l YVj=i ~ P)* t ^ len 

mm m 

£W) = £( l °g(p , ') c, + E iog(i-pE) 

j =i i =1 

m 

= E log -p i ) s: "= i ^ c ‘) 

f=i 

which indicates that 

p4* = arg max (p 3 ') 0 ’(1 — p J ')^-' i=1 > i ^ c 

£ [ 0 , 1 ] 

which results in (28). 
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